WHAT IS CLAIMED IS: 



A method of generating a wrapper for extracting information from a 



)sito^ ®f semistructured information, the method comprising the computer- 
implemented^steps of: 

I presenting to a user a set of predefined operation types from which to 
construct the wrapper, the predefined set comprising operation types for 
extracting data from the repository based on structural relationships within the 
semistruqtured information; 

prelsenting to the user an option to select operation types from the set and 
to specify properties of selected: operation types, to allow the user to define 
operations to be performed by the wrapper; 

moniroring and recording selections by the user of operation types and 
entry of operation properties, to generate the wrapper; and 

generating and displaying on a display screen a graphical representation 
of the wrapper i^ which wrapper operations are depicted as objects on the 
display screen. 

2. The methoti of Claim 1, further comprising presenting to the user an 
option to select operation^ depicted ^n^the display screen to allow the user to define 
links between operations, wl^erein a link represents a call of a destination operation by a 
source operation. 

3. The method of\Claim 2, wherein links are graphically depicted on the 
display screen. 

4. The method of Cl^imS.,, wherein the repository comprises a Web site. 

5. The method of Clauii 4, wherein the set of predefined operation types 
provides the capability of matching ^regular expression. 

6. The method of Claim ^, wherein the generated wrapper is configured to 
output extracted data in a structured fomat. 

7. The method of Claim 6, mrther comprising visually depicting execution 
of the wrapper through the graphical representation of the wrapper. 

8. The method of Claim 7, finjher comprising displaying output of the 
wrapper. 
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9. The methoa of Claim 8, further comprising displaying data extracted 
from the semistructured information by the wrapper. 

10. The method\of Clain?:^7" further comprising performing a statistical 
analysis on the output of the wrapper. 

5 1 1 . A method of generating a \^^apper for extracting data from a Web site, 

the method comprising: 

invoking an appliJ^ation, the application providing a graphical design 
environment in which the wrapper can be graphically represented; 

using the applicatioAto create objects in the graphical design 
10 environment, the objects representing operations for extracting data from a Web 

site based on structural relationships within the Web site; and 

using the application to create links between the objects in the graphical 
design environment, wherein a link represents a call of a destination operation 
by a source operation, and whereinythe wrapper comprises the operations and the 
15 calls. 

12. The method of Claim 11, fikther comprising using the application to 
debug the wrapper. -^"^ \ 

13. The method of Claim 12, furthericomprising: 
using the application to-^xecute the wrapper; and 

20 using the application to display the\data extracted from the Web site in a 

structured format. 

14. The method of Claim H^^fiirther cbmprising using the application to 
examine the Web site to identify information of interest. 

15. The method of Claim 11, further comprrsing defining a structured format 
25 in which the wrapper can output the information. 

16. A computer program for faciUtating th^ design and creation of a 
wrapper, the computer program comprising, on a computer readable medium: 

wrapper editor code which provides a graphical design environment ifT 
which a wrapper can be graphically constructed; 
30 wrapper rendering code which displays a graphical representation of the 

wrapper within the graphical design environment as the wrapper is constructed; 
and 
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operation class code which defines a set of operation classes for 
extracting data from semistructured information based on structural relationships 
within the semistructured information, wherein the wrapper comprises instances 
of the operation classes. 

5 17. The computer program of Claim ^IG^^wherein the semistructured 

information comprises a Web site. 

18. The computer program of Claim IT^ferther comprising table display 
code which displays infomlation extracted from the Web site. 

19. The computer program of Claim 18, further comprising wrapper run 
10 code which enables the execution of the wrapper. ^ 

20. The computer^ program of Claim 19, fixrther comprising wrapper 
i«i debugging code. 

21. The computer program of Claim 20, wherein the set of operation classes 
m enable the structuring of extracted data. 

P 15 22. The computer program of Claim 21, wherein the wrapper debugging 

T: \ 

r-^ code provides options for controlling the execution of the wrapper, the options 

comprising starting the wrapper, stopping the wrapper, and setting breakpoints. 
:^ 23. The computer program of Claim 22, wherein the wrapper debugging 

II- code provides options for displaying transitory data" produced by the wrapper during 

20 execution. \ 

24. The computer program or\ Claim 22, fiirther comprising Web viewer 
code which displays Web pages. 

25. A method of executing a wrapper, the method comprising: 
creating an instance of a wrapper within an internal memory of a 

25 computer, the wrapper facilitating the automated extraction of information from 

a Web site; 

providing a graphical representatioh of the wrapper on a display screen; 
providing a user interface through w)jich a user can control the execution 
of the wrapper; 

30 monitoring the user interface for user ifiput; 

controlling the execution the wrapper inVesponse to user input; 
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graphically depicting the execution of the wrapper through the graphical 

1 ' 

representation of the^wrapper; and 

displaying in^rmation extracted from the Web site. 

26. The method of Claim 25, further comprising: 

graphically deputing objects on the display screen, the objects 
representing operations to |>r^formed by the wrapper; and 

graphically depicting links between objects on the display screen, 
wherein a link represents a call of a destination operation by a source operation. 

27. The method of Claim 26^wherein the user interface provides to the user 
options for controlling the executipn of the wrapper, the . options comprising starting the 
wrapper, stopping the wrapper, ana setting breakpoints at operations. 

28. The method of Clami further comprising performing a statistical 
analysis on output of the wrapper. \ 

29. The method of Claim\ 27, wherein the wrapper is configured to output 
extracted information in a structured foilnat. 
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